Picture for Yanmin Qian

Yanmin Qian

Representation-Regularized Convolutional Audio Transformer for Audio Understanding

Add code
Jan 29, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon

DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice

Add code
Jan 22, 2026
Viaarxiv icon

ICASSP 2026 URGENT Speech Enhancement Challenge

Add code
Jan 20, 2026
Viaarxiv icon

A Data-Centric Approach to Generalizable Speech Deepfake Detection

Add code
Dec 24, 2025
Figure 1 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 2 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 3 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 4 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Viaarxiv icon

USE: A Unified Model for Universal Sound Separation and Extraction

Add code
Dec 24, 2025
Viaarxiv icon

What Does the Speaker Embedding Encode?

Add code
Dec 20, 2025
Figure 1 for What Does the Speaker Embedding Encode?
Figure 2 for What Does the Speaker Embedding Encode?
Figure 3 for What Does the Speaker Embedding Encode?
Figure 4 for What Does the Speaker Embedding Encode?
Viaarxiv icon

Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection

Add code
Aug 17, 2025
Figure 1 for Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection
Figure 2 for Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection
Figure 3 for Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection
Figure 4 for Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection
Viaarxiv icon

FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation

Add code
Jul 22, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon